HIT2 Joint NLP Lab at the NTCIR-9 Intent Task
نویسندگان
چکیده
The report hereby is to represent the principle, the searching process and experiment results. We report our systems and experiments in the intent task of NTCIR 9. The research aims at evaluating the effectiveness of the proposed methods on query intent mining and results diversification in terms of web search. In the subtopic mining subtask, we combine the extracted candidates from search logs and Wikipedia. An improvement could be seen after incorporating query intents from different resources. In the document ranking subtask, greedy algorithms are taken to select documents with the high diversified score and return a re-ranked list of diversified documents based on query subtopics. The experiment results show that the method, that is combining subtopic results directly, outperforms MMR.
منابع مشابه
Using Google Translation in Cross-Lingual Information Retrieval
HIT2 Lab participated in NTCIR 7 IR4QA task. In this task many topics consist of name entities, so Google translation was used to translate query terms because of its high performance on name entity translation. We use KL-divergence model to perform retrieval and Chinese character bigram as our indexing unit. Pseudo feedback was used trying to improve average precision. We achieved competitive ...
متن کاملZSWSL Text Entailment Recognizing System at NTCIR-9 RITE Task
This paper describes our system on simplified Chinese textual entailment recognizing RITE task at NTCIR-9. Both lexical and semantic features are extracted using NLP methods. Three classification models are used and compared for the classification task, Rule-based algorithms, SVM and C4.5. C4.5 gives the best result on testing data set. Evaluation at NTCIR-9 RITE shows 72% accuracy on BC subtas...
متن کاملRMIT and Gunma University at NTCIR-9 Intent Task
In this report, we describe our experimental results for the NTCIR-9 intent task. For our experiments, we use our experimental search engine, Newt. Newt is a ranked selfindex capable of supporting multiple languages by deferring linguistic decisions until query time. To our knowledge, this is the first Information Retrieval task on the ClueWeb09-JA collection performed entirely with ranked self...
متن کاملOverview of the NTCIR-9 INTENT Task
This is an overview of the NTCIR-9 INTENT task, which comprises the Subtopic Mining and the Document Ranking subtasks. The INTENT task attracted participating teams from seven different countries/regions – 16 teams for Subtopic Mining and 8 teams for Document Ranking. The Subtopic Mining subtask received 42 Chinese runs and 14 Japanese runs; the Document Ranking subtask received 24 Chinese runs...
متن کاملUniversity of Glasgow at the NTCIR-9 Intent task: Experiments with Terrier on Subtopic Mining and Document Ranking
We describe our participation in the subtopic mining and document ranking subtasks of the NTCIR-9 Intent task, for both Chinese and Japanese languages. In the subtopic mining subtask, we experiment with a novel data-driven approach for ranking reformulations of an ambiguous query. In the document ranking subtask, we deploy our state-ofthe-art xQuAD framework for search result diversification.
متن کامل